PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0017s0086.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family CPP
Protein Properties Length: 2241aa    MW: 222010 Da    PI: 6.9995
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0017s0086.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR44.33.5e-1410721111242
                  TCR    2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42  
                           ++k+C+Ckks+Clk+YC+Cfaag++C++ C+C +C+N+ e+
  Vocar.0017s0086.1.p 1072 SSKSCRCKKSQCLKLYCDCFAAGQYCGS-CSCISCHNRPEH 1111
                           689*************************.********9875 PP

2TCR46.38.3e-1511431181139
                  TCR    1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39  
                           k+k+gCnC+ks+ClkkYCeC++ g+kC+ +C+C +C+N 
  Vocar.0017s0086.1.p 1143 KHKRGCNCRKSHCLKKYCECYQGGVKCGIQCTCMECENM 1181
                           589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011149.5E-1410711111IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163430.72210721183IPR005172CRC domain
PfamPF036381.3E-1110741108IPR005172CRC domain
SMARTSM011141.4E-1311431184IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.7E-1111451181IPR005172CRC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2241 aa     Download sequence    Send to blast
MRRSSPPREP GVGPEGAKAS ARPGLRTVPT MPNKRQAIQG YDVDNSLGSP LFPRDYDQLR  60
NLLGSPLPPL HSPPLFQPSP RRSILTSPAR PAANQRPSNQ IPSNNSHRHN PDSMDALASF  120
FAPSPALPVP LFSPSAAAPN SLFDTSVNRA RADVFTPTPC KSGALDHVTA LLHDVGRGGQ  180
SGAGSDPIGV HLQHQRCNND SAVPTNAATD PHHHEGGSGG SGSGSGSGGT AIPVNVASAS  240
ASAGISCTAQ LAAAALRSQA NKASSSHGFH MIHDGGFGFG FGRSQATPGL MLSLSGQGGF  300
GPIHHPVPGY PSDLLGISGP GSHMFGGSGG AAAGIDGGSG NGGGLYGSFC GIANGGGGGC  360
AGLQLNVKDM LLAHHDDDDR SGGLLTSMPS FCGGGGSGGG GGGFGSGLGL GRPRSRYLDF  420
CTTPRHSADA AASGGAAAAP DATSGDGARA GGSGSSGGVA SNVGAGGGSG SSVGIGEGVG  480
VVGSVEGGRC GGGGIYPGGL SSRHHGGGLL LAPQLPAPPL LQTETNTATT GSNTGQHSRG  540
ADGAAESPPE SLDEPQKPVL SYPSPNEETR IAIAGGCVKV VESGASRTTV ETPGVELGCG  600
PSGGGGGASG GITSTEVAAG PAAAATGTER MATVSASAAG GGGGGGSNGV LVPSSGGSFG  660
LRTESLLVLP GTSVAVPGGG GGGLMVPVSG SGRGGGGNDG GGCDLSSSME ADTMQYSAGG  720
RGSGAGGVCL DRDLGMSSPS LLPPPPLSLG GLMPSSFSVP PHLQPNHHHL QHHHHHHHHQ  780
QQQQQQHYQS SAMTSSMMDL SMARGGSGAG TGASTLMMVS EGGGAMPGLQ PLSQMQMPLC  840
EDDKSFVKRQ IQQQQMQQQQ MQQQQMQQQQ PLVRGGSGTF AVRMASGGSG AGGGGGGGVN  900
GSSGGLLQGR DAAAAVAASG GCERPGVGLP PRSGGGGGGG INVMPGGGMP GAAVAAASVG  960
GAGGGGAGNG GASTPSTLQR PQRTRTASSY GGGGAGGAMN EGTVMSMRGA PSMDFDSDVV  1020
VPELELSPDF PGRGGINANA NPNAHRSSGA GGGLTQIQGG GPNRGRRTSE NSSKSCRCKK  1080
SQCLKLYCDC FAAGQYCGSC SCISCHNRPE HADRVLQRRE DIAARDPQAF TRKIQLAPNG  1140
NGKHKRGCNC RKSHCLKKYC ECYQGGVKCG IQCTCMECEN MDVGSSQEGA GARGALKRGG  1200
AAAKGAGGRA GGGGGGGGSR AGSRRSSATG MYDDYAPSPP LPSTSGCSDG PSPTPSQGTV  1260
PGSVMLQPPP PLASMPSLTV AAAAAAAAAA TTAATTNHFA MSLGSGGDGA AGCTASMPYG  1320
GGHALSAGVV QFSEDGTVRR NSTNSLSHSQ APPVAQPPQL LPPSQQQQQQ LQSQMQSMPA  1380
PLPPNFLRQQ QQQEVQLQPM HSHPHHQQQQ QEQQESPSLI CGEMLQQQQQ QQQQQQQQQQ  1440
QQQQQQQQQQ QQQQQQQQQQ QYYQQQQMVK RSLPPELYGS GSGSDAVARD TCCRGDGGDG  1500
DGEILPGNLR DFQGVVRDEM DEDAEEEEEE EEGDGPSQEQ LGPLKRRRKQ ELGRRTAATP  1560
LPLPSDHPTA PTSESSALAT GGTWAEEARN SAGCNNRRGA AATVAAAHAS DNNADNAYPR  1620
TEGGGMGDMT LAAVGTAGGD LGPSGAGAAA AMAAAAAMPP PPSGQDLRFS LGPEPPGFTP  1680
RGLGISSLDV VSPPPLSMLT HLESDTDSDG GGLEGAGGGL GCRPRRRSAQ HQYRQQYMNT  1740
GGAAVAAAAG GGGGGGGGAA DVPHPSALRR NGGSRHQSHG MVGLDVGVMD FDDSSSALAD  1800
AMITAIADEA SRGPMAGGGE CVATTAGAAG TAAAAAAAAA AAGRHQHRQS GEAAAAAGSD  1860
AATAREGGGV LLCGELLGSG CLLDDGSNDM FLAGFEPNSV EGARGCGSFG FGSGGSGGGF  1920
LSPRFGGMGG GNSSFGLATS PTAFPRSGGV NGSLGLCAVS PQWRVRPPGL GPMGEVAAAV  1980
GPPLGLMSSN SWLHLPHRRP SRFAPTRVNG GSGGGGSSAM SYDPSQLPPW PPVASVTTCT  2040
VGGPLVPELC GLDSGLALKG GHLDMGRAGA AAAAAASVHT LVSPVRTSAM SAAALARRRE  2100
TGGPEGDASR APSYQGAPSG CGDEQRESGP WVVPAAHHHL EAASSPSKQQ RCFMATAAGG  2160
GGNLAAPLQL PQPSAMTPGE QPQFDILTGG SGGGHPRTQP PGGGRGGSRN GGGCAGAAGG  2220
GASANRPPRA GSFVADNGAA *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112061214GGRAGGGGG
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMap-Retrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002953265.10.0hypothetical protein VOLCADRAFT_94037
TrEMBLD8U3R60.0D8U3R6_VOLCA; Putative uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP3231530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22760.14e-29Tesmin/TSO1-like CXC domain-containing protein